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Abstract. We consider a crucial aspect of self-organization of a sensor 
network consisting of a large set of simple sensor nodes with no location 
hardware and only very limited communication range. After having been 
distributed randomly in a given two-dimensional region, the nodes are re- 
quired to develop a sense for the environment, based on a limited amount 
of local communication. We describe algorithmic approaches for deter- 
mining the structure of boundary nodes of the region, and the topology of 
the region. We also develop methods for determining the outside bound- 
ary, the distance to the closest boundary for each point, the Voronoi 
diagram of the different boundaries, and the geometric thickness of the 
network. Our methods rely on a number of natural assumptions that are 
present in densely distributed sets of nodes, and make use of a combi- 
nation of stochastics, topology, and geometry. Evaluation requires only 
a limited number of simple local computations. 
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1 Introduction 

In recent time, the study of wireless sensor networks (WSN) has become a rapidly 
developing research area that offers fascinating perspectives for combining tech- 
nical progress with new applications of distributed computing. Typical scenarios 
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involve a large swarm of small and inexpensive processor nodes, each with limited 
computing and communication resources, that are distributed in some geomet- 
ric region; communication is performed by wireless radio with limited range. As 
energy consumption is a limiting factor for the lifetime of a node, communica- 
tion has to be minimized. Upon start-up, the swarm forms a decentralized and 
self-organizing network that surveys the region. 

From an algorithmic point of view, the characteristics of a sensor network 
require working under a paradigm that is different from classical models of com- 
putation: Absence of a central control unit, limited capabilities of nodes, and 
limited communication between nodes require developing new algorithmic ideas 
that combine methods of distributed computing and network protocols with tra- 
ditional centralized network algorithms. In other words: How can wc use a limited 
amount of strictly local information in order to achieve distributed knowledge 
of global network properties? 

This task is much simpler if the exact location of each node is known. Com- 
puting node coordinates has received a considerable amount of attention. Unfor- 
tunately, computing exact coordinates requires the use of special location hard- 
ware like GPS, or alternatively, scanning devices, imposing physical demands on 
size and structure of sensor nodes. A promising alternative may be continuous 
range modulation for measuring distances between nodes, but possible results 
have their limits: The accumulated inaccuracies from local measurements tend to 
produce significant errors when used on a global scale. This is well-known from 
the somewhat similar issue of odometry from the more progressed field of robot 
navigation, where much more powerful measurement and computing devices are 
used to maintaining a robot's location, requiring additional navigation tools. See 
SB03j for some examples and references. Finally, computation and use of exact 
coordinates of nodes tends to be cumbersome, if high accuracy is desired. 

It is one of the main objectives of this paper to demonstrate that there may 
be a way to sidestep many of the above difficulties: Computing coordinates is 
not an end in itself. Instead, some structural location aspects do not depend 
on coordinates. An additional motivation for our work is the fact that location 
awareness for sensor networks in the presence of obstacles (i.e., in the presence 
of holes in the surveyed region) has received only little attention. 

One key aspect of location awareness is boundary recognition, making sen- 
sors close to the boundary of the surveyed region aware of their position and 
letting them form connected boundary strips along each verge. This is of major 
importance for keeping track of events entering or leaving the region, as well as 
for communication purposes to the outside. Neglecting the existence of holes in 
the region may also cause problems in communication, as routing along shortest 
paths tends to put an increased load on nodes along boundaries, exhausting their 
energy supply prematurely; thus, a moderately-sized hole (caused by obstacles, 
by an event, or by a cluster of failed nodes) may tend to grow larger and larger. 

Beyond discovering closeness to the boundary, it is desirable to determine 
the number and structure of boundaries, but also more advanced properties 
like membership to the outer boundary (which separates the region from the 



unbounded portion of the outside world) as opposed to the inner boundaries 
(which separate the swarm from mere holes in the region). Other important 
goals are the recognition of nodes that are well-protected by being far from the 
boundary, the recognition of nodes that are on the watershed between different 
boundaries (i.e., the Voronoi subdivision of the region), and the computation of 
the overall geometric thickness of the region, i.e., the size of the largest circle 
that can be fully inscribed into the region. 

We show that based on a small number of natural assumptions, a consider- 
able amount of location awareness can indeed be achieved in a large swarm of 
sensor nodes, in a relatively simple and self-organizing manner after deployment, 
without any use of location hardware. Our approach combines aspects of random 
distributions with natural geometric and topological properties. 

Related Work. There are many papers dealing with node coordinates; for 
an overview, consider the cross-references for some of the following papers. A 
number of authors use anchors with known coordinates for computing node lo- 



calization, in combination with hop count. See DPG01 SRL02 SK02| , |CHH0l| 
uses only distances between nodes for building coordinates, based on triangu- 
lation. PBDT03 presents a fully distributed algorithm that builds coordinate 
axes based on a by a near/far-metric and runs a spring embedder. Many related 
graph problems are NP-hard, as shown by BK98 for unit disk graph recognition, 
and by AGY04] for the case of known distances to the neighbors. 

On the other hand, holes in the environment have rarely been considered. 
This is closely related to the fc-coverage problem: Decide whether all points in 
the network area are monitored by at least k nodes, where k is given and fixed. 
In this context, a point is monitored by a node, if it is within the node's sensing 
area, which in turn is usually assumed to be a disk of fixed size. By setting the 
sensing range to half the communication range, 1-coverage becomes a decision 
problem for the existence of holes. In |HT03j . an algorithm for fc-coverage that 
can be distributed is proposed. Unfortunately, it requires precise coordinates for 
all nodes. In |FCjG04| . holes are addressed with greedy geographic routing in 
mind: Nodes where data packets can get stuck are identified using a fully local 
rule, allowing identification of the adjacent hole by using a distributed algorithm. 
Again, node coordinates must be known for both detection rule and bypassing 
algorithm to work. |Bea03| considers detection of holes resulting from failing 
nodes. It proposes a distributed algorithm that uses a hierarchical clustering to 
find a set of clusters that touch the failing region and circumscribe it. 

Our Results. We show that distributed location awareness can be achieved 
without the help of location hardware. In particular: 

— We describe how to recognize the nodes that are near the boundary of the 
region. The underlying geometric idea is quite simple, but it requires some 
effort on both stochastics and communication to make it work. 

— We extend our ideas to distinguish the outside boundary from the interior 
boundaries. 

— We describe how to compute both boundary distance for all nodes and overall 
region thickness. 



— We sketch how to organize communication along the boundaries. 

— We describe how to compute the Voronoi boundaries that are halfway be- 
tween different parts of the boundary. 

The rest of this paper is organized as follows. In Section [21 we give some 
basic notation and state our underlying model assumptions. In Section we 
describe how to obtain an auxiliary tree structure that is used for computing 
and distributing global network parameters. Section 0] gives a brief overview of 
probabilistic aspects that are used in the rest of the paper to allow topology 
recognition. Section [5] describes how to perform boundary recognition, while 
Section gives a sketch of how to compute more advanced properties. Section[7] 
describes implementation issues and shows some of our experiments. Finally, 
Section [S] discusses the possibilities for further progress based on our work. 

2 Preliminaries 

Swarm and geometry. In the following, we assume that the swarm consists 
of a set V of nodes, and the cardinality of V is some large number n. Each node 
v € V has a globally unique ID (for simplicity, denoted by v) of storage size 
O(logn), and coordinates x v that are unknown to any node. All node positions 
are contained in some connected region A nct C R 2 , described by its boundary 
elements. In computational geometry, it is common to consider regions that 
are non-degenerate polygons, bounded by k disjoint closed polygonal curves, 
consisting of a total of s line segments, meeting pairwise in a total of s corners. 
Each of the k boundary curves separates the interior of A nct from a connected 
component of R 2 \ A net . The unique boundary curve separating A nct from the 
infinite component of R 2 \A nct is called the outside boundary; all other boundaries 
are inside boundaries, separating A net from holes. Thus, the genus of A nct is 
k — 1. As we do not care about the exact shapes (and an explicit description of 
the boundary is neither required nor available to the nodes), we do not assume 
that A nc t is polygonal, meaning that curved (e.g., circular) instead of linear 
boundary pieces are admissible; we still assume that it consists of s elementary 
curves, joined at s corners, with a total number of k boundaries. 
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Fig. 1. Geometric parameters. 



As narrow bottlenecks in a region can lead to various computational prob- 
lems, a standard assumption in computational geometry is to consider regions 
with a lower bound on the fatness of the region; for a polygonal region, this 
is defined as the ratio between d m i n , the smallest distance between a corner 
of the region and a nonadjacent boundary segment, and rf max , the diameter of 
the region. When dealing with sensor networks, the only relevant parameter for 
measuring distances is the communication radius, R. Thus, we we use a similar 
parameter, called feature size, which is the ratio between d m i n and R. For the 
rest of this paper, we assume that feature size has a lower bound of 2. (This 
technical assumption is not completely necessary, but it simplifies some matters, 
which is necessary because of limited space.) In addition, we assume that angles 
between adjacent boundary elements are bounded away from and from 2ir, 
implying that there are no sharp, pointy corners in the region. 

Node distribution. A natural scenario for the deployment of a sensor net- 
work is to sprinkle a large number of small nodes evenly over a region. Thus, we 
assume that the positioning of nodes in the region is the result of a random pro- 
cess with a uniform distribution on A net . We also assume reasonable density; in 
a mathematical sense, this will be made clear further down. In a practical sense, 
we assume that each node can communicate with at least 100 other nodes, and 
the overall network is connected. 

A(-) denotes a volume function (i.e., the Lebesgue measure) on R 2 , therefore 
< X(A DCt ) < oo. For simplicity, A G := ttR 2 denotes the area of the disk with 
radius R. 

Using the notation V(A) := {v G V : x v G A}, the expected number of nodes 
to fall into an area A C A nct is therefore 

n\V(A)\]=n-^- (1) 

Therefore, a node v G V that is not close to the network area's boundary, 
i.e., Br(x v ) C A net has an estimated neighborhood size of 

„:=E[\N(v)\] = (n-l)-^- (2) 

Here, B r (x) denotes the ball around x with radius r. 

Node communication. Nodes can broadcast messages that are received by 
all nodes within communication range. The cost of broadcasting one message of 
size m is assumed to be 0(m); e.g., any message containing a sender ID incurs 
a cost of O(logn). 

We assume that two nodes u ^ v G V can communicate if, and only if, 
they are within distance R. This is modeled by a set of edges, i.e., uv G E : 
\\x u — x v \\ ^ R, where ||-|| denotes the Euclidean norm. The set of adjacent 
nodes of v G V is denoted by N(v), and does not include v itself. Such a graph 
is known under many names, e.g., geometric, (unit) disk, or distance graph. The 
maximum degree is denoted by A := max„ e v \N(v)\. 



3 Leader Election and Tree Construction 



A first step for self-organizing the swarm of nodes is building an auxiliary struc- 
ture that is used for gathering and distributing data. The algorithms that are 
presented in Section [S] only work if certain global network parameters are known 
to all nodes. By using a directed spanning tree, nodes know when the data ag- 
gregation phase terminates and subsequent algorithmic steps may follow. This is 
in contrast to other methods like flooding, where termination time is unknown. 
The issue of leader election has been studied in various contexts; see BKKM96 
for a good description. In principle, protocols for leader election may be used for 
our purposes, as they implicitly construct the desired tree; however, using node 
IDs (or pre-assigning leadership) does offer some simplification. 

An alternative to leader election is offered by the seminal paper [UHS83 
dealing with distributed and emergent search tree construction. It builds a min- 
imum spanning tree in a graph with n nodes in a distributed fashion, using 
only local communication. Complexities are 0(n log 2 n) time from the first mes- 
sage by a node to completed information at the constructed root, and 0(log 2 n) 
transmissions per node, consisting of O(logn) messages, each sized O(logn). 

In the following, we will use this auxiliary tree; in particular, we may assume 
that each node knows N{v) and n, and it is able to use the tree for requesting 
and obtaining global data. Note that the tree is only used for bootstrapping the 
network; it may be replaced by a more robust structure at a later time. 

4 Probabilistic Aspects 

The idea for recognizing boundary nodes is relatively simple: Their communica- 
tion range intersects a smaller than average portion of the region, and thus N(v) 
is smaller than in other parts. However, a random distribution of nodes does not 
imply that the size of N(v) is an immediate measure for the intersected area, 
as there may be natural fluctuations in density that could be misinterpreted as 
boundary nodes. In order to allow dealing with this difficulty, we introduce a 
number of probabilistic tools. 

Recall that Chebychev's inequality shows that for a binomial distribution for 
n events with probability p, i.e., for a bin (rt,p)-distributed random variable X, 
and a < 1 

Pr[A ^ anp] ^ — • const — > 0(n — > oo) (3) 
n 

holds. We exploit this fact to provide a simple local rule to let nodes decide 
whether they are close to the boundary cM ne t- Let a < 1 be fixed, and let 

D = D{a) :={veV: \N(v)\ < afi} . (4) 

Theorem 1. Let v be a node whose communication range lies entirely in A nct . 
Then v ^ D with high probability. 



Proof. This follows directly from ©, as 

Pr[|JV(«)|<a/i]=Pr[|V(flfl(iB w ))\H|<aM]->0 (n -► oo) . (5) 

□ 

Theorem 2. Let x G dA uct be on the network area's boundary. Let e > 0. 
Assume a > j-\(P>R +e (x) (~l A nct ). Then, with high probability, there is a node 
v G D with \\x — x v || ^ e. 

Proof. Let A e (x) := B e (x) Pi A no t be the area where w is supposed to be. Then 
X(A £ (x)) > by our assumption on feature size. The probability that there is no 

node in A e {x) equals the probability for a bin (n, ^isfell ^ -distributed variable 

to become zero, i.e., 

Pr[|y(A £ (x))| = 0]= (l-^f^)" -+0 (n->oo). (6) 

On the other hand, the probability that a node u in A e (x) has more than a/i 
neighbors is 

Pr[|iV(u)| > afi] = Pi[\V(A E (x))\ > u[i + 1 | u exists] 
^ Pr[\V(B R+e {x))\ > an+1] 
— > (n — » oo), because aA > A(Aij+ e (x)) . 

Together, we get 

Pr[3u G V,x v G A e (x) : |iV(u)| < a/j,} 
= 1 - Pv[V(A e (x)) = 0] - Pr[Vw G F(A £ (a;)) : \N(v)\ > af i \ V(A £ {x)) ^ 0] 
^ 1 - Pv[V(A e (x)) = 0] - Pr[|AT( w )| > afi \ v G F(A e (x))] 
— > 1 (n — > oo) , 

which proves the claim. □ 

The assumed lower bound on a can be derived from natural geometric prop- 
erties. For example, if all angles are between ^ and then for a > 0.75 the 
condition holds for a reasonably small e. We conclude that D reflects the bound- 
ary very closely. It can be determined by a simple local rule, namely checking 
whether the number of neighbors falls below afi. However, this requires that all 
nodes know the value of a\i. The next Section focuses on this key issue by 
providing distributed methods for estimating /i and a. 

5 Boundary Computation 

As described in the previous section, the key for deciding boundary membership 
is to obtain good estimates for the average density /i of fully contained nodes, 
and determining a good threshold a. In the following Subsection l5.1l we derive a 
method for determining a good value for fi. Subsection 15 . 21 gives an overview of 
the resulting distributed algorithm, if a has been fixed. The final Subsection 15. 31 
discusses how to find a good value for a. 



5.1 Determining Unconstrained Average Node Degree 

Computing the overall average neighborhood size can be performed easily by 
using the tree structure described above. However, for computing /x, we need 
the average over unconstrained neighborhoods; the existence of various pieces of 
boundary may lower the average, thus resulting in wrong estimates. 

On the other hand, it is not hard to determine the maximum neighborhood 
size A. As was shown by |AR97| . the ratio of maximum to average degree in the 
unit square intersection graph of a set of n random points with uniform distri- 
bution inside of a large square tends to 1 as n tends to infinity. We believe that 
a similar result can be derived for unit disk intersection graphs. Unfortunately, 
convergence of the ratio is quite slow, and using A as an estimate for \x is not a 
good idea. For our illustrative example with 45,000 nodes (see FigureEl, we get 
A/n w 1.37. 

However, even for very moderate sizes of n, A is within a small constant of 
yU, allowing us to compute the node degree histogram shown in Figure [21 again 
by using the auxiliary tree structure. Clearly, the histogram arises by overlaying 
three different distributions: 

1. The neighborhood sizes of all non-boundary nodes. 

2. The neighborhood sizes of near-boundary nodes, at varying distance from 
the boundary. 

3. The neighborhood sizes of boundary nodes. 

We expect a pronounced binomial distribution around fi for (1.), a uniform 
distribution for values safely between /i/2 and \x for 2., overlayed with a small 
binomial distribution for values under fi/2 for 3., possibly skewed in the presence 
of many nodes near corners of the region. (The latter is not to be expected under 
our geometric assumption of bounded feature size and minimum angle, but could 
be used as an indication of a large number of pointy corners otherwise.) 
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Fig. 2. Node degree histogram 

Obviously, a variety of other conclusions could be drawn from the node degree 
histogram. Here we only use the most common neighborhood size At e st as an 
estimate for /i. In our example, /i « 179.65 and /% s t = 177. The according 
histogram is shown in Figure |21 It resembles the expected shape very closely. 



5.2 Algorithms 



When the auxiliary tree is constructed, its root first queries the tree for A, and 
afterwards for the neighborhood size histogram. Using A, it can quantize the 
histogram to a fixed number of entries while expecting a high resolution. This 
step involves per-node transmissions of 0(1) for queries and O(logn) for the 
responses. On reception of the histogram, the root determines fj, cs t- Assuming 
that a is known, it then starts a network flood to pass the value a/^ C st to all 
nodes. Message complexity for the flood is O(logn). 

A node receiving this threshold decides whether it belongs to D. In this case, 
it informs its neighbors of this decision after passing on the flood. These nodes 
form connected boundaries by constructing a tree as described in Section^ with 
the additional condition that two nodes in D are considered being connected if 
their hop distance is at most 2. 

The root of the resulting tree assigns the boundary a unique ID, e.g., its 
node ID. This ID is then broadcast over the tree. All nodes receiving their ID 
start another network flood. This flood is used such that each node determines 
the hop count to its closest boundary. If it receives messages informing it of 
two different boundaries at roughly the same distance, it declares itself to be a 
Voronoi node. 

In addition, the boundary root attempts to establish a one-dimensional co- 
ordinate system in the boundary by sending a message token. The recipient of 
this token chooses a successor to forward the token to, which has to acknowledge 
this choice. Of the possible successors, i.e., not explicitly excluded nodes, the one 
having the smallest common neighborhood with the current token holder is cho- 
sen. The nodes receiving the token passing message without being the successor 
declare themselves as excluded for futher elections. After traveling a few hops, 
the boundary root gets prioritized in searching for the token's next hop, thereby 
closing the token path and forming a closed loop through the boundary. This 
path can then be used as axis for the one-dimensional coordinates. 

5.3 Determining a Good Threshold a 

Our algorithms depend on a good choice of the area-dependent parameter a, 
which should be as low as possible without violating the lower bound from 
Theorem |21 If a bound on corner angles is known in advance, say, 3tt/2 in a 
rectilinear setting, this is easy: For example, choose a slightly larger than 3/4. 
As this may not always be the case, it is desirable to develop methods for the 
swarm itself to determine a useful a. 

For a too small a, no node will be considered part of the boundary. For 
increasing a, the number of connected boundary pieces grows rapidly, until a is 
large enough to allow different pieces of the same boundary to grow together, 
eventually forming the correct set of boundary strips. When further increasing 
a, additional boundaries appear in low-density areas, increasing the number of 
identified boundaries. These boundaries also begin to merge, until eventually a 
single boundary consisting of the whole network is left. 
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Fig. 3. The number of boundary components as a function of a. 

Figure [3] shows that this expected behavior does indeed occur in reality: 
Notice the clear plateau at 4 connected components, embedded between two 
pronounced peaks. 

This shows that computing a good threshold can be achieved by sampling 
possible values of a and keeping track of the number of connected boundary 
components. 

6 Higher-Order Parameters 

Once the network has identified boundary structures, it is possible to make use of 
this structure for obtaining higher-order information. In this section, we sketch 
how some of them can be determined. 

6.1 Detection of Outer Boundary 

One possible way to guess the outer boundary is to hope that the shape of 
boundary curves is not too complicated, which implies that the outside boundary 
is longest, and thus has the largest number of points. An alternative heuristic is 
motivated by the following theorem. See Figure 0] for the idea. 

Theorem 3. Let P be a simple closed polygonal curve with feature size at least 
2R and total Euclidean length £{P), consisting of edges ej, and let <fi be the 
(outside) angle between edges and Let Bi{P) be the set of all points that 
are near P as an inner boundary, i.e., that are outside of P and within distance 
R of P , and let B a (P) be the set of all points that are near P as an outside 
boundary, i.e., that are inside of P and within distance R of P. Then the area 



of B {P) is Ri(P) - T, iPl>0 R2: f + T, iPl <o R2taii (^2 ± ) 7rR2 > while the area °f 
B t {P) is R£(P) + E^o W ~ E v ,>o R 2 tan(^)7ri? 2 . 



Proof. As shown in Figure 01 both B (P) and Bi(P) can be subdivided into a 
number of strips Sj parallel to edges ej of P, and circular segments near vertices 
of P that are either positive (when the angle between adjacent strips is positive, 
meaning there is a gap between the strips) or negative (when the angle is nega- 
tive, meaning that strips overlap.) More precisely, if the angle ifi between edges 



(a) The area of outside 
and inside strips. 



(b) Overlap between 
adjacent strips. 



Fig. 4. Geometry of boundary strips. 

ej and ej+i is positive, we get an additional area of R 2 ^-, while for a negative ipi, 
the overlap is R 2 tan(^-). In total, this yields the claimed area. Our assumption 
on feature size guarantees that no further overlap occurs. □ 

Note that in any case, J^i f% = Making use of this property is possible 
in various ways: It is natural to assume that the area for the exterior boundary 
strip is less than suggested by its length, while it should be larger for all other 
boundaries. This remains true for other kinds of boundary curves using similar 
arguments. 

A straightforward estimate for strip area is given by the number \N(Dj)\, 
while \Dj \ is a natural estimate for the length of the boundary, as the density of 
boundary nodes should be reasonably uniform along the boundary. Near bound- 
ary corners, the actual number of boundary nodes will be higher for convex 
corners (as the threshold of neighborhood size remains valid at larger distance 
from the boundary), and lower for nonconvex corners. Thus it makes sense to 
consider the ratio ^grj^ for all boundary components, as the outside boundary 
can be expected to have lower than average \N(Dj)\ and higher than average 
\Dj\. The component with the lowest such ratio is the most likely candidate for 
being the outside boundary. See Table2]for the values of our standard example. 
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Table 1. Number of boundary nodes and near-boundary nodes for each bound- 
ary component. 



The main appeal of this approach is that the required data is already avail- 
able, so evaluation is extremely simple; we do not even have to determine hop 
count along the boundary. 

It should be noted that our heuristic may produce wrong results if there is 
an extremely complicated inside boundary. This can be fixed by keeping track 
of angles (or curvature) along the boundary; however, the resulting protocols 
become more complicated, and we leave this extension to future work. 



6.2 Using Boundary Distance 

Once all boundaries have been determined, it is easy to compute boundary dis- 
tances for each node by determining a hop count from the boundary. Note that 
this can be done to yield non-integral distances by assigning fractional distances 
to the near-boundary nodes, depending on their neighborhood size. 

This makes it easy to compute the geometric thickness of the region: Compute 
a node with maximum boundary distance. In our standard example, this node 
is located between the three inside boundaries. 



7 Experimental Results 



All our above algorithms have been implemented and tested on different point 



sets. See the following Figure 5(a) for an example with 45,000 nodes and four 
boundaries. The bounding box has a size of 30i? x 30R. Total area of the region is 
786. 9R 2 . Figure 5(b) shows the spatial distribution of neighborhood size. Notice 



the slope near the boundaries. Figure 6(a) shows the identified boundary, near- 
boundary and Voronoi nodes, shown as black dots, gray crosses, black triangles, 
while other interior nodes are drawn as thin gray dots. The total number of 
identified boundary or near-boundary nodes is 11,358, leaving 33,642 nodes as 
interior nodes. Finally, Figure 6(b) shows the assigned structural loops along the 
various boundary strips. 



8 Conclusions 

We have shown that dealing with topology issues in a large and dense sensor 
network is possible, even in the absence of location hardware or the computation 
of coordinates. We hope to continue this first study in various ways. One possible 
extension arises from recognizing more detailed Voronoi structures by making 
use of the shape of the boundary distance terrain: This shape differs for nodes 
that have only one closest line segment in the boundary, as opposed to nodes 
that are close to two different such segment, constituting a ridge in the terrain. 
Note that our Voronoi nodes are close to pieces from two different boundaries. 

An obvious limitation of our present approach is the requirement for high 
density of nodes. A promising avenue for overcoming this deficiency is to exploit 
higher-order information of the neighborhood structure, using more sophisticated 



(a) Sensor network topology. (b) Spatial distribution of |iV(v)|. 

Fig. 5. Example network consisting of 45,000 nodes. 




(a) Boundary nodes, near-boundary 
nodes, and Voronoi nodes. 

Fig. 6. Experimental ri 




(b) Structural loops along the 
boundaries. 

for the example network. 



geometric properties and algorithms. This should also allow the discovery and 
construction of more complex aspects of the network, e.g., for routing and energy 
management. 
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